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ABSTRACT 

We present a modified version of the friends-of-friends (FOF) structure finding algorithm 
(Huchra & Geller 1982), designed specifically to locate groups or clusters of galaxies in pho- 
tometric redshift datasets. The main objective of this paper is to show that this extended 
friends-of-friends (hereafter EXT-FOF) algorithm yields results almost identical to the orig- 
inal FOF, if applied to a spectroscopic redshift dataset, and a rather conservative catalogue 
of structures, in case of a dataset with simulated photometric redshifts. Theref ore, we create 
group catalogues for the first Center for Astrophysics Redshift Survey (CFA 1 : [ puchra et all 
Il983h . as well as for the Las Campanas Redshift Survey (LCRS; Shectman et al. 1996), both 
of which being spectroscopic surveys, using FOF-algorithms. We then apply our new algo- 
rithm to said surveys and compare the resulting structure catalogue. Furthermore, we bestow 
simulated photometric redshifts on the LCRS galaxies, and use the EXT-FOF to detect struc- 
tures, that we compare in size and composition to the ones found in the original, spectroscopic 
dataset. We will show that the properties of this modified algorithm are well understood and 
that it is suited for finding structures in photometric datasets. This is the first paper in a se- 
ries of papers, dealing with the application of our new cluster finding algorithm to various 
photometric redshift galaxy surveys. 
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1 INTRODUCTION 

Optical and near-IR galaxy surveys, and especially the groups and 
clusters of galaxies found therein, have been playing an impor- 
tant role in our understanding of structure forma ti on an d cosmol- 
ogy. A s ha s been shown by IPress & SchechteJ il974) . IPeeblesI 
jl993h . and lEkeetall {l99g), among others, the evolution of the 
mass-function of groups and clusters of galaxies is highly sen- 
sitive to cosmological parameters, the type of dark matter and 
the biasing of dark against baryonic matter. Thus, comparison of 
the mass-function predicted by semi-analytic structure formation 
mode ls, like the Press-Schechter formalism ( Press & Schechter 
1 1974 ^). or N-body simulations with the observed mass-function 
yields strong constraints on structure formation scenarios and the 
cosmological paradigm jB ahcall & Cen||l992|: |Ueda e t alJll993l : 
|jing&Fang||l994l: iBahcall et alJll997tlBode et alJbOOlh .Finding 
large-scale structures in galaxy surveys is also an important step 
in examining the evolution of galactic properties, like the lumi- 
nosity function, in high- and low-density enviro nments iTrenthaml 
199a Ide Propris et alJll999l: Drory et al. 2001a: iFried et all200 it 
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nation techniques jBaumI 


19621: lKodll985: Brunner et alJll999l: 


iFemandez-Soto et alJl99S 


;lBem'tezl200Cl:lBender et alj200ll). ac- 



proximate redshifts became available for all galaxies in a photo- 
metric multi-band survey, without having to do time-consuming 
spectroscopic follow-up observations. This type of galaxy sur- 
vey is increasing in popularity. The list of surveys using pho- 
tometric redshift determin ation includes the Hubble Deep Fields 
(HDF) North and South ( Williams et alll99dl200m . the Las Cam- 
panas Infrared Survey (LCIR; Marzk e et all 19991 McCarthy et al. 
2001a, b), the Calar Alto Deep Imaging Survey (CADIS; Wolf 
et al. 2001a), the Classifying Objects by Medium-Band Observa- 
tions Survey (COMBO-17; Wolf et al. 2001b), the Munich Near- 
IR Cluster Survey (MUNICS; Drory et al. 2001b), and the Fors 
Deep Field (FDF; Heidt et al. 2003). However, photometrically de- 
termined redshifts possess larger errors than spectroscopically de- 
termined ones. The accuracy of the photometric redshift determina- 
tion is typically worse than the spectroscopic one by more than two 
orders of magnitude. Thus, the photometric redshift errors corre- 
spond to roughly 50 times the typical velocity dispersion of bound 
structures, like galaxy groups or clusters. 

The actual identification of groups or clusters of galaxies, ei- 
ther in an observed or simulated catalogue, is a non-trivial enter- 
prise. A reliable detection of large-scale structures in an observed 
galaxy catalogue is even more complicated if the galaxies have only 
photometric redshifts or, even worse, none at all. Up to now, many 
different techniques have been created for the purpose of structure 
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finding. All of them have their advantages and disadvantages and 
are constructed to fit the specific features of their surveys, like the 
wavelength of the observed radiation, or whether redshifts were de- 
termined for all objects, and if so, what type of redshift determina- 
tion was used. Giving a complete list of all types of structure finding 
techniques applied to optical and near-IR data, and their individual 
implementations would exceed the scope of this paper. However, 
we give a brief overview over the more important ones and their 
working principles, to better place our work in the context of struc- 
ture finding. 



1958|) . foll owed by |Ab ell et al 



field was 


achieved by 
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(1989), 


Lidman & Peterson 



in-cells method looks for surface density enhancements in circu- 
lar apertures of fixe d physical rad ius. A modification to this ap- 



proach was used by Zwickv et al. (1968'), 'Tu rner & GottI il976h . 
[Couch et al. (1991), and Plionis et al. (1991), who searched for sur- 
face density enhancements by method of isopleth-contours. An- 
other rather famous me thod is the hierarchical clusteri ng technique 
jMatemdll978t lTull^^[l9 80: Hodd & Materne 198^ ITu11\< [19871: 
lGourgoulhor^tan ^9?). It identifies structures on the basis of 
optimising typical cluster or group properties, such as spatial sep- 
aration or luminosity density, by combining individual galaxies 
into groups, with the advantage of delivering a list of local con- 
centrations within the groups, and superstructures at given values 
of the property that is to be optimised. One of the more recently 
developed methods is the matched or generalised likelihood filter 
JPostman et aJl99dlSchuecker & BoehringeJl998tlKeDner et alJ 
ll999tlLoboetall2000l) . On the basis of Bayesian probability the- 
ory, the galaxy catalogue is convolved with a set of models, the 
filter, describing typical cluster properties, like the radial profile or 
the luminosity function, thus determining the likelihood of having 
clusters with given positions and richnesses in the input dataset. 
Voronoi tessellation has also been used as a means of detecting 
large-scale structures ( Ramella et al. 2001 ; Kim et al. 2002). In this 
case, a unique plane or volume partition of the galaxy distribu- 
tion is done, yielding a measure for the local densities, thus en- 
abling identification of groups or clusters as significantly over- 
dense regions. Techniques looking for spherical density enha nce- 
ments have been used by Lace v & Cole ( 1994) and Cole & La cevI 
il996h . S ome more exceptional approaches are t he wavelet decom- 
position ^Sle zak et alJll990|-. IParedes et aljfl9951 the HOP algo- 
rithm (Eisenst e i^^^u3 ll998h . the cluster red sequenc e approach 
ioiSlders & Yed200d) . or the cut-and-enhance method iGoto et all 
l2002h . 

Probably the most extensively used structure finding al- 
gorithm is the friends-of-friends technique or percolation algo- 
rithm. This method looks for number density enhancements in 
three dimensions by searching for galaxy pairs, that are closer 
to one another than a given cut-off separation. Consequently, the 
friends-of-friends does not suffer a bias as far as the radial pro- 
file, or luminosity function are concerned, as is for example the 
case for the matched filter technique. Up to now, the friends-of- 
friends (FOF) appr oach has only been applied to spectroscopic 
redshift surve ys tHuchra & Ge llei"l982t: iGeller & Huc hriJll983t 
Ramella e^lJ^89^^997£toasarti-Battistoni 1998; Giuricii^^lJ 
Merchan et alj 12000 * ' ' " 



2000; 



[Tucker etal. 2000; Ramella et al. 



2002) , or N-body simulations tPavis et al.,19 85; Efstathi ou et al. 
19881 iLacev & Cold Il994 |CokT^e31l996; Vala geas etlilj 



200Cft . In the first case, the spectroscopic redshift is a combination 
of the Hubble expansion and the peculiar velocity of the galaxy, 
thus being a very good estimate of the true distance to the galaxy. 



In the second case, the exact position in three-dimensional space is 
known. Applying this algorithm to a galaxy catalogue with photo- 
metric redshifts constitutes a problem, due to the less than perfect 
distance information. If the redshift errors are not taken into ac- 
count, the resulting structure catalogue will contain highly unphys- 
ical groups and clusters. If the errors are taken into account, yet 
the algorithm itself is not changed, the procedure yields structures, 
that are extremely elongated in redshift (Az ~ 1). Thus it proves 
necessary to develop a new type of FOF algorithm, the extended 
friends-of-friends, for photometric redshift datasets, that includes 
the redshift error information and at the same time cuts down on 
unrealistically elongated structures. 

Sectio nlUgives a brief overview of the original FOF technique 
of lHuchra & Geller (1982) and introduces the EXT-FOF algorithm. 
The following two sections prove, that in case of galaxy catalogues 
with spectroscopically determined redshifts, both algorithms yield 
almost identical group catalogues. In Section |3| both friends-of- 
friends algorithms are applie d to the spectroscopic C FAl survey, 
following the recipe given bv lGeller & Huchr j Jl983h . The result- 
ing group catalogues are compared on an object-to-object basis and 
the deviations are explained. The same is done for the LCRS in Sec- 
tion|4| following the approach of Tucker et al. ( 2000). In order to 
show that our EXT-FOF technique provides a conservative group 
and cluster catalogue for photometric galaxy surveys, we simulate 
photometric redshifts for the LCRS galaxies in Section|5| We then 
apply our EXT-FOF to this new dataset and compare the detected 
structures to the ones found with the EXT-FOF algorithm in the 
original, spectroscopic dataset. 

This paper is the first in a series, dealing with the search for 
large-scale structures, like groups and clusters of galaxies, in pho- 
tometric redshift datasets. In the next paper, we will apply our new 
cluster finding technique to MUNICS and test the plausibility of the 
found structures, using colour-magnitude diagrams, Voronoi tessel- 
lations, and a likelihood approach. The same will be performed for 
the FDF galaxy catalogue in our third paper. 



2 FRIENDS-OF-FRIENDS (FOF) ALGORITHMS 

The friends-of- f riends algorithm, that was invented by 
'Huchra & Gelled il982h . is one of the most frequently used 
cluster finding techniques. It was designed to find number overden- 
sities in spectroscopic galaxy surveys and has also been modified to 
look for structures in simulated galaxy datasets. The FOF approach 
is very straightforward and has no need for complicated cluster 
models. It looks for galaxy pairs, that are closer to one another 
than a given cut-off separation. One of the resulting advantages is 
its insensitivity to cluster shape. However, like most of the other 
structure finding techniques, it has the drawback of being rather 
slow in its application. Changing the cut-off parameters requires 
a new run of the procedure, which makes the optimisation of the 
cut-off values time-consuming. 



2.1 The Huchra & Geller friends-of-friends algorithm 

The original FOF algorithm was created to look for groups and 
clusters of galaxies in the magnitude-limited CFAl redshift sur- 
vey jil uchra & Gellei 1982; Huchra et al. 1983; Geller & Hucl^ 
Il983l) . It makes use of three basic pieces of information, the posi- 
tions of the galaxies in right ascension and declination, and their 
spectroscopic redshifts. 

A slightly modified version of the algorithm's flow chart given 
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Figure 1. Flow chart for the Huchra & GeUer friends-of-friends algorithm 



bv lHuchra& Gelled il982h is shown in Fig. Q First, an object i 
from the galaxy catalogue, that has not yet been assigned to any 
group, is chosen. Then friends j of that galaxy are searched for. 
They have to fulfill two criteria. Their projected separation Dij 
from the first galaxy has to be less than a critical value Dl'- 



2 sin 



V_ 

Ho 



< Dl 



(1) 



where Oij is the angular separation between galaxies i and j. Ho is 
the Hubble constant, and V is the mean velocity of the galaxy pair: 

Vi + Vj 



Vi and Vj being the velocities of the individual galaxies. Further- 
more, their separation Vij in velocity-space has to be less than a 
second critical value Vl '■ 



Vij ■- \vi -Vj \ < Vl 



(3) 



If there are no friends for galaxy i, it is called an isolated object and 
removed from the catalogue of possible cluster members. If friends 
are found for galaxy i, a list for group-candidate k is initiated, con- 
taining galaxy i and all of its friends j, that fulfill eqs. Q and J^. 
The search for friends is extended to the surroundings of the galax- 
ies j. All friends found are once more added to the group-candidate 
k. This is done until no further friends of friends can be detected. 
The group-candidate k is called a real group, if the number of 
its members exceeds the limit Nmin'- 



Nk > iV„ 



(4) 



V = 



(2) 



If this is not the case, all members of said group-candidate are re- 
moved from the catalogue. The next galaxy from the catalogue is 
then taken and its surroundings are searched for friends. 

As can be seen easily, this algorithm is commutative and yields 
reproducible results, which are very important features for any 
structure finding technique. 

The choice for the linking parameters Dl and Vl depends on 
the properties of the input galaxy dataset. In principle, both param- 
eters can be chosen to remain either fixed or to vary with redshift. 
The minimum group size limit Nmin is usually set to 3. Thus only 
galaxy pairs are excluded from the final group catalogue. 



2.2 The extended friends-of-friends algorithm 

Due to the relatively large errors in the photometric redshifts, the 
friends-of-friends algorithm cannot be used for this type of dataset 
without some modifications. If the photometric redshift errors are 
not taken into account, a large fraction of the resulting groups and 
clusters are not physically bound objects. While the linking velocity 
Vl of the original FOF usually corresponds roughly to typical clus- 
ter velocity dispersions, photometric redshift errors are normally 
larger by a factor of 50. This implies that the deviation between the 
measured and the true distance to a galaxy might be off by about 50 
times the linking velocity. Identification of these galaxies as group 
members is impossible with the original FOF technique. On the 
other hand, galaxies, whose true positions on the velocity axis are 
deviant by about 100 Vl, could be combined into groups. Sim- 
ply including the redshift errors of the individual galaxies into the 
linking criteria does not solve this dilemma either. The resulting 
structures would get unreasonably extended in redshift. Allowing 
for the necessary large value in the linking velocity would enable 
huge chains of galaxies to be linked together by the FOF algo- 
rithm. Thus, a modified version of the FOF algorithm, the extended 
friends-of-friends or EXT-FOF, becomes necessary for photometric 
redshift galaxy surveys. 

The EXT-FOF technique utilises the same informations as the 
original FOF, i.e. right ascensions, declinations and, in this case, 
photometric redshifts of the galaxies. Furthermore the individual 
redshift error of each galaxy is taken into account. 

The algorithm itself consists of three parts: In its inner loop, 
the technique is almost identical to the Huchra & Geller method, 
with slightly altered linking conditions. The search for groups is 
done in various a priori redshift-slices, meaning only galaxies that 
are compatible with a given value of redshift are taken into account 
for cluster finding. This redshift-slicing constitutes the outer loop. 
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Figure 2. Flow chart for the extended friends-of-friends algorithm 
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As a result, there is a catalogue of structures for every redshift- 
slice. Every galaxy can only be a member of one structure of the 
catalogue belonging to a given z-slice. Yet it can also belong to 
other structures in the other 2;-slices. The third part of the algorithm 
is the unification of all structures that have at least one member in 
common. This guarantees that in the final catalogue every galaxy 
can only belong to one group or cluster. 

The underlying idea for the modification of the linking condi- 
tions is the following: The a priori redshift slices Zini approximate 
roughly the mean redshifts V of the original FOF-technique. So eq. 
Q changes into 

D,j ■-2sm^D{zin,) < Dl. (5) 
D (zini) is the distance to Zini in Mpc, being either 

(6) 

in the case of a low-redshift approximation, or more correctly 

D (Zini) ~ dA {Ho, flM,QA, Zini) (7) 

in the case of a cosmologically exact distance measurement. 
dA [Hoj^M ,^A, Zini) is the angular distance to Zini for a cos- 
mology with given Ho, Qm and Qa JCarroletalj ri992V Eq. OJ 
translates into two equations, one for each of the galaxies i and j: 



Vi ~ \vi — CZini\ ^ ~i (8) 

Vl 

:= \vj - czini \ < (9) 

If the individual redshift errors Szi and Szj of the galaxies i and j 
are taken into account, the left side of relations {Sj and j9) changes: 

V, < [(Vl/2)' + (cSz,)^] - Vl,,., (10) 
V, < [{Vl/2)' + (cfc,)^] =: Vl,,. (11) 



Obviously, in the case of very small redshift errors, eqs. (SJ and j9) 
are good approximations for eqs. <10t and <1 It . 

A flow chart for the EXT-FOF algorithm is shown in Fig.|2| 
First, the minimal redshift Zini ~ Zmin is chosen for structure find- 
ing. A catalogue of groups, that belong to this Zini-siice, is created 
as follows: An object i, that has not yet been assigned to any group 
belonging to this redshift-slice, is chosen from the catalogue. Un- 
like the original FOF, this galaxy also has to be compatible with 
the chosen Zini, meaning it has to fulfill eq. <10> (or eq. (S)). Then, 
friends j of that galaxy are searched for. They also have to be com- 
patible with the chosen Zini, i.e. they have to satisfy eq. <1U (or 
eq. j9}) and have to be closer to galaxy i than the cut-off distance 
Dl (eq. 0)- If no friends can be found, object i is moved to a 
list of isolated galaxies. If friends can be found, a group-candidate 
k (zini) is initiated and the galaxies i and j are added to it. The sur- 
roundings of the galaxies j are searched for companions fulfilling 
eqs. jsj and <I 1> (or j9j), and the loop is repeated until no further 
friends can be found. A group-candidate is called a real group, if 
the number A^fc(zi„i) of galaxies belonging to it satisfies the relation 

Nk(zi„^) > Nmin (12) 

and the group is then added to the catalogue of structures for the 
given redshift-slice. If this is not the case, the galaxy i and its 
friends are moved to the list of isolated objects. The next galaxy 
from the catalogue is then taken and its surroundings are searched 
for friends. If all galaxies are assigned either to a group or to the iso- 
lated list and no galaxies are left in the input catalogue, the search is 



continued in the next redshift-slice. For this, Zini is increased by a 
value Azini and the original input catalogue of galaxies is restored. 
The search for friends is then repeated as described above. Once the 
value of Zini has reached the limit z,nax, the search is stopped. So 
far, this technique yields {z^ax — Zmin) / Azini individual cata- 
logues of clusters. Since one galaxy can belong to different clusters 
in different redshift-slices, a unification of these groups is neces- 
sary to remove this ambiguity. To do this, a group I of any arbitrary 
Zini-slice is taken. A, so far empty, unified group m is created. All 
galaxies of the above mentioned group I are added to the unified 
group m and group I is removed from the group catalogue. Next, 
the unification procedure looks for a galaxy in unified group m that 
also belongs to another, non-unified, group n, irrespective of the 
redshift-slice that this group belongs to. If no such galaxy can be 
found, unified group m is complete, and the process is repeated for 
another group I. If there is a galaxy in m that belongs to a group 
n, as well, all galaxies of group n are added to the unified group m 
and n gets removed from the catalogue. This loop is repeated until 
unified group m is complete and none of its members are contained 
in any of the non-unified groups any more. The result is a catalogue 
of disjoint structures. 

Like Huchra & Geller's version, this new algorithm is com- 
mutative and yields reproducible results, i.e. the sequence of the 
input galaxies does not play a role. 

One of this technique's objectives is to reproduce group cata- 
logues found with the Huchra & Geller FOF in the case of spectro- 
scopic datasets. To do this, the values of Zini have to approximate 
every possible value of V for every galaxy pair. Theoretically, this 
can be reached by using a continuum of Zini-values, ranging from 
the minimum V of the two galaxies in the dataset that have the 
lowest redshifts to the maximum V of the galaxy pair with highest 
redshifts. Of course this is not feasible in practice. Instead, a dis- 
crete set of Zini values is used with a very small spacing Azini- 
The boundaries Zmin and Zmax are simply set to the minimum and 
maximum redshifts of the galaxies used for structure finding. 

There are two basic differences between the two FOF tech- 
niques: While the Huchra & Geller technique tests, whether eq. Q 
is fulfilled for a galaxy pair at only one redshift, the new algorithm 
tests, whether eq. j5} is satisfied for said galaxy pair at a multitude 
of redshifts. Furthermore, with the original algorithm, galaxies that 
are close to one another in projection can be linked together, even 
though not all of them are close to one another in redshift. Thus, 
chains of A*' > N,nin galaxies can be linked together into groups, 
that are very elongated in the redshift. In the case of the extended 
FOF, all A'^ > N,nin galaxies have to be compatible with a given 
redshift Zini in order to be called a group. This makes the above 
mentioned outcome of elongated galaxy chains highly unlikely and 
is the reason why the new algorithm is well-suited for cluster find- 
ing in photometric redshift surveys. The next Sections 01 111 ™d 
|5} show all of the possible effects resulting from these discrepan- 
cies and prove the validity of the extended friends-of-friends for 
photometric redshift datasets. 



3 APPLICATION TO THE CFAl REDSHIFT SURVEY 
3.1 The galaxy catalogue 

The CFA 1 redshift survey jHuchraet alJl983h is a magnitude lim- 
ited, spectroscopic galaxy survey, containing all t he gal axie s of the 
Zwicky jZwickv et al. 1968; Zwickv & Zwickvi |l97ll) or iNilsoi] 
('1973") catalogues, that satisfy the following selection criteria: 
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nipg < 14.5 mag , 
and 

fe>40°,(5>0°, or 6 < -30°, (5 > -2?5. 



(13) 



(14) 



The magnitudes are given in the B(0)-Zwicky system. The original 
catalogue covers 2401 galaxies. 

In this paper, an electronic version of the CFAl galaxy cata- 
logue, created in May 1997, is used. This new catalogue contains 
2396 galaxies with corrected magnitudes and redshifts. 

3.2 Creation of the group catalogues 

Following the recipe of iGeller & Huchral il983h . two group cata- 
logues are created, one by using the original FOP technique, the 
other by applying the ne w EXT-FOF algorithm. A summary of the 
lGeller&HuctirJ<1983h treatment of the galaxy data and choice of 
the linking parameters is given below: 

First, the heliocentric galaxy velocities are corrected for a 
dipole virgocentric flow 

Vv = Vin [sin Si sin 5v + cos Si cos Sv cos (a,; — av)] , (15) 

where Vin is the infall velocity, which is set to 300 km s^^. ay 
and Sv are the right ascension and declinati on of the Virgo cluste r 
(av = 12''28™7; Sv = 12°19!l: J1950: iRamella etal]|l997h . 
and Qi, and Si are the right ascension and declination of galaxy i. 
Furthermore, a correc tion for the solar m otion with respect to the 
local group is made IRamella et alll997h : 



Vg ~ 300 km s ^ sin k cos bi. 



(16) 



li and bi are the galactic longitude and latitude of galaxy i. All 
galaxies with corrected velocities less than 300 km s^^ are given 
an indicative velocity of 300 km s~^. This is done to avoid the sin- 
gularity at km s~^ . 

Only galaxies with velocities less than 12000 km s~^ are ac- 
cepted for the cluster search, while the mean velocity of the groups 
is limited to less than 8000 km s~^. 

A Hubble constant of Ho = 100 k m s~^ Mpc~^ is used, to b e 
in accordance with the approach used by'Geller & Huchra' 

The linking parameters are chosen to vary with redshift, in 
order to compensate for the variati on in the sampling of th e lu- 
minosity function, as d escribed in iHuchra & GelleJ jl982h . and 
lGeller&Huc'hr3<1983h : 



Dl = DoR, 
and 

Vl = VoR, 
with 

/•Mi. 

R 



$ (M) dM I I $ (M) dM 



$ (M) is the Schechter luminosity function with a 
M* = -19.40 mag, and $* = 0.0143 Mpc^^. 

Mii,ri = miim - 25 - 5 log (Vfid/Ho) , 

and 

Mij = niiim - 25 - 5 log (V/Ho) , 

in case of the original FOF technique. For cluster finding with the 
EXT-FOF algorithm, eq. <2H changes to 
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Figure 3. Variation of the linking criteria and Vl for the CFAl group 
catalogues, as a function of the FOF mean separation V, or the EXT-FOF 
redshift-slicing Zi^i 



■ miim - 25 - 5 log [czini/Ho) . 



(22) 



The fiducial velocity Vfid is set to Vfid ~ 1000 km s"^, the pro- 
jected separation Do and the velocity difference Vb at the fiducial 
velocity are chosen as Do ~ 0.52 Mpc and Vb = 600 km s"^. The 
resulting variation of the linking criteria and Vl as a function 
of V, or Zini respectively, is shown in Fig.|3| The described varia- 
tion of the linking parameters enables the algorithm to detect only 
structures that have a given minimal number overdensity relative to 
the mean, completely independent of the structur e's redshift. 

T he CFAl group catalogue published by iGeller & Huchral 
(Il98^ contained only groups with more than two members, so we 

set Nmin = 3. 

Since the CFAl is a local galaxy survey, the low-redshift ap- 
proximation, eq. J6l, is used for the distance calculation in the EXT- 
FOF. Because of the very small redshift errors of this spectroscopic 
survey, eqs. js} and j9j are utilised as the velocity linking criteria. 
The redshift-spacing is set to /S.Zini — 10"^, to make sure that 
every possible value of V can be approximated. 



3.3 Comparison of tlie group catalogues 

The catalogue of structures, resulting from the application of 
the original FOF algorithm, contains 176 groups and clusters, 
comprising a total of 1480 galaxies. The composition of these 
F OF structures i s sligh tly deviant from the ones published by 
lGeller& Huchral [1981 . This minor discrepancy is due to the dif- 
ferent input galaxy catalogues. 

Application of the EXT-FOF algorithm yields 165 groups, 
containing 1518 galaxies in total. Thus, in the case of the CFAl, 
the EXT-FOF shows a tendency to identify somewhat larger struc- 
tures. 

In order to compare the algorithms, we do an object-to-object 
comparison of the group members. We define eight categories for 
the level of agreement in the group composition: 
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Table 1. Comparison of the CFAl FOF and EXT-FOF structures 





category 


FOF 


EXT-FOF 


%FOF 


% EXT-FOF 


total 




176 


165 


100 


100 


identical 


1 


122 


122 


69.3 


73.9 


only FOF struct. 


2 


4 




2.3 




only EXT-FOF struct. 


3 




8 




4.8 


FOF struct, larger than EXT-FOF 


4 


5 


5 


2.8 


3.0 


EXT-FOF struct, larger than FOF 


5 


12 


12 


6.8 


7.3 


FOF struct, is combination of EXT-FOF structs. 


6 








0.0 


0.0 


EXT-FOF struct, is combination of FOF structs. 


7 


28 


13 


15.9 


7.9 


FOF and EXT-FOF structs. have some elements in common 


8 


5 


5 


2.8 


3.0 


FOF structs. found with EXT-FOF alg. 




172 




97.7 





(1) The FOF and EXT-FOF group have identical composition 

(2) The group is only found with FOF, i.e. no galaxy in the group 
is a member of an EXT-FOF group 

(3) The group is only found with EXT-FOF, i.e. no galaxy in the 
group is a member of a FOF group 

(4) The EXT-FOF group is a subset of the FOF group, i.e. the 
FOF group has more members than the EXT-FOF, and none of 
those surplus members belong to any other EXT-FOF group 

(5) The FOF group is a subset of the EXT-FOF group, i.e. the 
EXT-FOF group has more members than the FOF, and none of 
those surplus members belong to any other FOF group 

(6) The FOF group is a combination of multiple EXT-FOF 
groups, and can also contain further galaxies that are not part of 
any other EXT-FOF group 

(7) The EXT-FOF group is a combination of multiple FOF 
groups, and can also contain further galaxies that are not part of 
any other FOF group 

(8) The FOF and EXT-FOF group have some members in com- 
mon, but do not fall under any of the above mentioned criteria 

This classification scheme facilitates the examination of the intrin- 
sic characteristics of the two friends-of-friends algorithms, as can 
be seen in the next Section l7!4l 

Table Q shows the statistics of the comparison between the 
FOF and EXT-FOF structures. The first column provides a short 
description of the categories used. In the second column, the cor- 
responding categories are listed for easy reference. The third and 
fourth column contain the number of FOF and EXT-FOF groups 
respectively, that fall under the specified category, while the fifth 
and sixth column show the percentage of these groups. 

122 groups or clusters are recovered identically (category 1) 
by both algorithms, corresponding roughly to 72% of the FOF or 
EXT-FOF structures. 17 of the FOF groups are found by the EXT- 
FOF algorithm either as slightly enlarged (category 5), or reduced 
(category 4) structures. 28 FOF structures are recovered by the 
EXT-FOF as a combination of multiple FOF groups (category 7). 
Further five FOF structures are found by the EXT-FOF, suffering 
from combinations of the categories 4, 5, 6, and 7 (i.e. category 8). 
Only four of the 176 FOF groups are not found with the EXT-FOF 
algorithm (category 2). Thus, a total of 172 FOF structures are re- 
covered by the EXT-FOF algorithm, corresponding to a recovery 
rate for the EXT-FOF algorithm of almost 98%. On the other hand, 
the EXT-FOF technique finds eight additional groups (category 3), 
that are not part of the FOF group catalogue, leading to a spurious 
detection rate for the EXT-FOF technique of less than 5%, provided 



that the FOF technique delivers a complete structure catalogue. A 
closer look at the category 2 groups shows, that they are extremely 
small. Three of them have only three members and one of them 
contains four members. Furthermore, they are very elongated in 
the redshift direction, an attribute that might speak against them as 
possible galaxy groups, anyway. The category 3 EXT-FOF groups 
are also very small. Four of them have three members, the other 
four contain four members. All things considered, in the case of 
the CFAl group determination, the EXT-FOF algorithm yields re- 
sults very similar to the FOF technique. A slight tendency to find 
larger groups with the EXT-FOF can be seen here. The cause for 
the discrepancies between the two friends-of-friends techniques is 
explained in Section l7!4l 

3.4 Analysis of the discrepancies 

The deviations between the two group catalogues can be ascribed 
to two effects: Either the EXT-FOF algorithm is able to link two 
galaxies together, that the original FOF cannot; Or the EXT-FOF 
algorithm can not link Nmin (here: Nmin ~ 3) objects together 
within one redshift-slice. 

If the EXT-FOF algorithm finds a link between two galaxies, 
yet the FOF does not, then one of the following deviations can re- 
sult: 

(1) The group can only be found with EXT-FOF (i.e. category 

3) 

(2) The group found with EXT-FOF contains more members 
than the corresponding FOF group (i.e. category 5) 

(3) The EXT-FOF group is a combination of multiple FOF 
groups (i.e. category 7) 

In the first case, the original FOF might find two linked galax- 
ies, but cannot find a third object, that fulfills the linking crite- 
ria. Thus the number of objects within the original FOF group- 
candidate is too small, i.e. eq. ^4) is violated, and the group is re- 
jected. If the EXT-FOF method is able to find a third object in the 
current redshift-slice, that is linked to one of the galaxies in the pair, 
the number of group members satisfies eq. <I2t and the group is ac- 
cepted. It is also possible, that the original FOF finds two galaxy 
pairs, but is not capable of connecting them, while the EXT-FOF 
algorithm is able to link two galaxies of each pair, thus creating a 
group-candidate containing four galaxies. Eq. <12t is then satisfied 
and the EXT-FOF group is accepted. 

In the second case, the EXT-FOF is able to find one or more 
galaxies, that are defined as isolated by the FOF algorithm, linked 
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to group members, resulting in a larger group in the EXT-FOF cat- 
alogue. 

In the third case, the original FOF finds two or more separate 
groups. If the EXT-FOF can find links between members of the 
different FOF groups, those groups are combined into one big EXT- 
FOF structure. Of course, it is also possible, that further galaxies, 
defined as isolated by the original FOF, get attached to such a big 
structure, following the reasoning of the second case. This leads 
to EXT-FOF groups, that consist of a number of FOF groups and 
some additional galaxies. 

If the EXT-FOF algorithm cannot find Nmin galaxies linked 
together within at least one of the redshift-slices, one of the follow- 
ing deviations can result: 

(1) The group can only be found with the original FOF (i.e. cat- 
egory 2) 

(2) The group found with FOF contains more members than the 
corresponding EXT-FOF group (i.e. category 4) 

(3) The FOF group is a combination of multiple EXT-FOF 
groups (i.e. category 6) 

In the first case, the original FOF finds at least Nmin = 3 
objects that are linked together, satisfying eq. J4}. The EXT-FOF, 
on the other hand, can only find pairs of galaxies in various redshift 
slices. Since every group-candidate has to fulfill eq. <12> in at least 
one redshift slice, these galaxy pairs are not taken into account in 
the unification process, and these galaxies are defined as isolated 
by the EXT-FOF algorithm. 

In the second case, one galaxy, defined as group member by 
the FOF, cannot be attached to an EXT-FOF group. This is due 
to the fact, that there is no redshift slice, where this galaxy can 
be linked to at least two (in general: Nmin ~ 1) members of this 
EXT-FOF structure. Thus the object is not entered into the pre- 
unification catalogue and is not included in the EXT-FOF group. 
It is also possible, that more than one galaxy of the FOF group is 
missing in the corresponding EXT-FOF group. This can happen, 
if the above mentioned situation is true for all of those galaxies 
and, additionally, there is no redshift slice where two or more of 
those missing galaxies are linked with one of the EXT-FOF group 
members. However, in the case of the CFAl group catalogues, none 
of the five FOF groups, that fall under category 4, are larger than 
their EXT-FOF counterpart by more than one group member. 

In the third case, the EXT-FOF finds two or more separate 
groups, while the FOF is able to link them together into one struc- 
ture. This can happen, if there are no redshift slices, where at least 
two members of one EXT-FOF group can be linked with at least one 
member of another group. Following the reasoning of the second 
case, it is furthermore possible, that the large FOF group can con- 
tain additional members, that are defined as isolated by the EXT- 
FOF. As far as the CFAl catalogues are concerned, this effect never 
shows up, though. 

Category 8 groups are generated by a combination of the two 
effects: Members get lost in the EXT-FOF groups, because of too 
few linked objects at a given redshift slice, and at the same time new 
links can be found to galaxies, that are defined as isolated by the 
original FOF, while some subset of the FOF and EXT-FOF group 
is identical. 

In one of the cases examined for category 3 of the CFAl com- 
parison, a group found with the original FOF, is removed from the 
group list because its mean velocity exceeds the limit. The EXT- 
FOF is able to find another galaxy linked to this group, pushing the 
group mean velocity below the limit. Thus, the group is included in 
the EXT-FOF, yet not in the FOF group catalogue. However, this is 
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Figure 4. Comparison of the CFAl projected linking distance (solid 
line) and the projected separation Dij (dashed line) of the galaxies i and 
J, as a function of the the mean distance V , or the redshift slice Zi^i, re- 
spectively. The angular separation 9ij of the chosen galaxy pair leads to 
two intersections between Dj^ and Dij, dividing the graph into three areas 
(dotted lines). The projected Hnking criterion Dij < Dj^ is satisfied in the 
areas I and III. It is not satisfied in area II. 



a border effect and could in principle also work in the other direc- 
tion, removing groups from the EXT-FOF catalogue (i.e. category 
2). 

The reason, why the EXT-FOF algorithm is sometimes able 
to find links between objects, while the original FOF is not, can be 
explained as follows: 

The equations used here for the calculation of the projected 
distance Dij, eqs. Q, js}, and ^6), show that Dij oc V and 
Dij (X Zini, respectively. However, being varied with the lumi- 
nosity function, the linking distance is not proportional to the 
redshift. Dl is growing monotonically with z, switching from a 
convex curvature at low redshifts, to concave at higher redshifts. 
The slope of Dij is set by the angular separation 9ij of the two 
galaxies. There are three possible scenarios for the behaviour of 
Dij and Dl'- 

Either Oij is so small, that Dij < Dl over the entire red- 
shift range of F G [300kms~\l2000kms"^], or z,„i G 
[0.001, 0.04] respectively, i.e. the projected linking criterion, eq. 
Q or is fulfilled at V and in all other redshift slices Zini- 

Or 9ij is so large, that Dij > Dl over the entire redshift range. 
This galaxy pair can neither be linked together by the FOF, nor by 
the EXT-FOF algorithm. 

Or 9ij lies between the above mentioned values. In this case, Dij 
intersects Dl twice. 

The third scenario can lead to the links, that can only be found with 
the EXT-FOF algorithm. In this case, the redshift range can be di- 
vided into three areas, which is shown in Fig.|4| The figure shows 
a comparison of the projected linking distance Dl (solid line) and 
the projected separation Dij (dashed line) of the galaxies i and j. 
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Figure 5. Comparison of the CFAl projected linking distance Dj^ (solid line) and projected separation Dij (dashed line), as a function of the the mean 
distance V, or the redshift slice Zini, for two example galaxy pairs. These schematic drawings show magnified excerpts of the transition regions (dotted lines) 
between the areas I and II (left panel), and the areas II and III (right panel), that are described in Fig.|3 Left panel: The projected separation Dij at the mean 
redshift V (denoted by the solid vertical line in area II) of the galaxies i and j, does not fulfill the projected linking criterion Dij < D^. Thus, the original 
FOF cannot link the two galaxies. In the case shown here, the range of Zini values, satisfying the EXT-FOF redshift-space linking criteria, eqs. jsj and l9l . 
reaches into area I. For simplification, only one of those redshift slices Zi„i and the corresponding value of Dij is shown here (solid vertical line in area I). 
Dij < Di^ is fulfilled for this redshift. and the galaxies i and j are found as linked by the EXT-FOF algorithm. Right panel: V is again lying in area II, thus 
the original FOF cannot link this galaxy pair. The range of Zi„i values, that satisfy the EXT-FOF redshift-space linking criteria, reach into area III in this case. 
Once more, only one of those redshift slices Zi„i and the coiresponding value of Dij is shown here (solid vertical line in area III). Dij < Dj^ is fulfilled for 
this redshift, and the galaxy pair i and j are found as linked by the EXT-FOF algorithm. 



as a function of the the mean distance V, or the redshift slice Zini, 
respectively. In the areas I and III, the projected linking criterion 
Dij < Dl is fulfilled. Whereas in area II it is never fulfilled. It 
becomes obvious, that it is critical in which of those areas the link- 
ing criteria are tested. The original FOF can only test the projected 
linking criterion at a fixed redshift V, set by the galaxy pair. The 
EXT-FOF, on the other hand, tests the linking criteria at all red- 
shifts. If the EXT-FOF redshift-space linking criteria, eqs. Js} and 
^9}, can be fulfilled for a given galaxy pair, then they are generally 
satisfied for an interval of Zini values. If eq. js} can also be sat- 
isfied for at least one of those redshifts Zi„i, then all criteria are 
fulfilled and the galaxy pair is found as linked by the EXT-FOF. 
Thus, only the EXT-FOF and not the original FOF is able to link a 
galaxy pair in one of the two following situations: The mean veloc- 
ity V is lying in the area II, and the redshift interval of Zini values, 
that satisfy eqs. ^8} and j9j, reaches either into area I (Fig.|5l left 
panel), or into area III (Fig.|5| right panel). As a result, the pro- 
jected linking criterion is not satisfied for the original FOF, while 
both linking criteria can be fulfilled in at least one Zi„i slice, in the 
case of the EXT-FOF 

It can be easily seen, that this effect does not exist, if Dl and 
Dij follow the same variation with redshift, Dl (z) oc Dij (z). 

In the following, we explain why the EXT-FOF algorithm 
sometimes can not link together Nmin = 3 objects within at least 
one Zini slice, and why this is the reason for some of the above 
mentioned deviations from the original FOF. 

The original FOF algorithm can move relatively freely along 
the redshift axis, when looking for friends. For example, we con- 
sider a set of galaxies a, b, and c, with given velocities Va, Vb, and 



Vc- For matters of simplicity, we assume Va < vt < Vc- The FOF 
calls those three galaxies a group, if there is at least one galaxy 
among them, that can be linked to the other two. If, for example, 
the galaxies a and 6, and b and c fulfill the linking criteria, i.e. eqs. 
Q and J3j, then a, b and c are linked together. It is not neces- 
sary, that a and c also satisfy the linking criteria, and Vac > Vl 
is possible. Thus, in principle, the FOF allows for very elongated 
chains of galaxies along the redshift axis, as long as these galax- 
ies have a small projected separation and the velocity difference 
Vij between next neighbours satisfies eq. ^3). This effect is what 
makes the original FOF unqualified for cluster finding in datasets 
with rather uncertain redshifts, like photometric galaxy catalogues. 

The EXT-FOF algorithm cannot move so freely along the red- 
shift axis. By only looking for friends among galaxies, that are 
compatible with a given redshift slice, the probability of finding 
very elongated structures is efficiently reduced. 

To illustrate the problem, Fig. |6| shows the case of a set of 
galaxies, a, b, and c, that are found as a group by the FOF, yet 
not by the EXT-FOF. For simplification, we choose a galaxy set, 
that has a very small spread in the projected distance, so that eqs. 
O and Q are fulfilled for all three galaxies, and we only have 
to test, whether the velocity linking criteria are fulfilled. The fig- 
ure shows the variation of the linking velocity Vl (solid line) with 
redshift Zini, or mean velocity V , respectively. The individual ve- 
locities Va,Vh, and Vc of the three galaxies are shown on the lower 
scale. The filled squares are the original FOF velocity separations 
Vab, Vbc, and Vac, resulting from eq. and plotted at the cor- 
responding mean velocities. It becomes obvious, that Vab and Vbc 
satisfy the velocity linking criterion, and together with the already 
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Figure 6. Comparison of the CFAl linking velocity Vi (solid line) with the 
original FOF (filled squares) and EXT-FOF (dashed lines) velocity separa- 
tions, Vij and Vi, for a set of three galaxies, a, 6, and c, that can only be 
linked with the original FOF, yet not with the EXT-FOF. The lower scale 
shows the redshift Zi^i, the upper scale shows the coiTesponding mean ve- 
locity V . The individual velocities Va, v\y, and Vc of the three galaxies are 
also shown on the lower scale. The original FOF velocity separations Vah^ 
V^c. and Vac, resulting from eq. jsj, are plotted at the corresponding mean 
velocities. The EXT-FOF velocity separations are plotted as 2 Va, 2 Vj,, and 
2 Vc to make an easy comparison with Vj, possible. The dotted lines show 
the areas, where eqs. |8} and |9j are satisfied for two galaxies: In area I, the 
requirement is fulfilled for the galaxies a and h, and in area II it is fulfilled 
for the galaxies h and c. In no redshift shce, can all Nmin = 3 galaxies 
satisfy the EXT-FOF velocity linking requirement Vi < Vj^ /2. 



fulfilled projected linking criterion, the original FOF combines the 
three galaxies into a group. The dashed lines denote the EXT-FOF 
velocity separations. They are plotted as 2Va, 2 Vb, and 2Vc to 
make an easy comparison with Vl possible. The dotted lines show 
the areas where eqs. js} and ^9) are satisfied for two galaxies: In 
area I, the requirement is fulfilled for the galaxies a and b, and in 
area II it is fulfilled for the galaxies 6 and c. There is no redshift 
slice, where all three galaxies satisfy the EXT-FOF velocity link- 
ing requirement Vi < Vl /2. As a result, the EXT-FOF can only 
find the galaxies a and b, or b and c linked together, in the Zini 
ranges given by the areas I, or II. Those galaxy pairs do not satisfy 
eq. <12> with Nmin ~ 3, and thus are not added to the group list. 

Theoretically, another effect, that could remove a single link 
between two galaxies, is conceivable. It would also lead to the ob- 
served phenomena of category 2, 4, 6, or 8. If the step size Azini of 
the EXT-FOF redshift slicing is chosen too large, not every possible 
value of V can be approximated anymore, and a link found with the 
original FOF could not be recovered with the EXT-FOF algorithm. 
This is never the case in one of the examined catalogues, though, 
proving that the chosen step size is sufficiently small and does not 
lead to unwanted numerical effects. 



4 APPLICATION TO THE LCRS REDSHIFT SURVEY 
4.1 The galaxy catalogue 

The LCRS jShectman et'Zlll99d) is an i?-band selected, spectro- 
scopic galaxy survey. It consists of six 1?5 x 80° strips, three of 
which are lying in the north, the other three in the south Galac- 
tic cap. The survey covers a total area of over 700 deg^, divided 
into 327 individual fields. For 120 of those fields spectroscopy is 
done with a 50 fibre multiobject spectrograph (MOS), and they 
have nominal apparent magnitude limits of 

16.0 mag < niR < 17.3 mag. (23) 

For the remaining 207 fields spectroscopy is done with a 1 12 fibre 
MOS, with nominal apparent magnitude limits of 

15.0 mag <mR < 17.7 mag. (24) 

For every setup, all of the fibres are used, but since each of the 
fields is observed only once, there is a field-to-field variation in the 
selection criteria. The protective tubing around the individual fibres 
of the MOS makes it impossible to observe spectra of objects, that 
are closer to each other than 55", leading to 55" "orphans", that 
have no spectroscopic redshifts. The surveyed galaxies are sampled 
randomly within each field. 

The catalogue we use here is a combination of the 23695 
galaxies that have spectroscopic redshifts and the 1694 55" "or- 
phans", resulting in a total of 25389 galaxies. Both datasets only 
include objects, that are lying within the geometric and photomet- 
ric boundaries of the survey. Artificial redshifts are assigned to the 
55" "orphans", by giving each of them the redshift of its nearest 
neighbour, convolved with a Gaussian of width a = 200 km s~^ 
(Tucker et al. 2 00Q. Tucker, priv. comm.). 



4.2 Creation of the group catalogues 

Following the recipe of iTucker et alj i200(]t) . we once more cre- 
ate two structure catalogues, one with the original FOF algorithm, 
the other wit h our EXT-FOF. In the following, an overview of the 
iTucker et a l. 1 2000) treatment of the data is shown. Furthermore, 
the application of this recipe to the EXT-FOF is explained. 

First, all galaxy velocities are corrected for motion relative 
to the dipole moment of the cosmic microwave background. The 
equation used by .Tucker et al. 12000.) is given in galactic coordi- 
nates 

Vv ~ Vq [sin bi sin 6q + cos bi cos 6q cos {h — Iq)] , (25) 

where Vp, = 368.9 km s~\ 6© = 48?05, and Iq = 264?33 
jLineweaver et alJI 19961 Tucker, priv. comm.), and bi, k are the 
galactic latitude and longitude of galaxy i. 

The set of galaxies used for cluster finding is limited to objects 
having corrected velocities Vi such that 

7500 km s"^ < Vi < 50000 km s~\ (26) 
and absolute magnitudes Mi between 

- 22.5 mag + 5 log h < hU < -17.5 mag + 5 log h. (27) 
Not taking any colour corrections into account, we simply set 

= mi - 51og(dL {Ho,nM,i^A,v^/c)) + 5, (28) 

where m.i is the apparent magnitude of galaxy i, and 
dL {Ho,^M ,^iA,Vi/c) is the luminosity distance for the given 
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cosmology to galaxy i jCarroletalll99A . Since the LCRS is rel- 
atively deep, going out to redshifts of roughly 0.2, it becomes nec- 
essary to use cosmologically correct expressions. 
Only groups with mean velocities Vg such that 



10000 km s^ 



<Vg < 45000 km s' 



(29) 



are accepted in the final group catalogue. 

We use a flat cosmology with Ho — 65 km s~^ Mpc~^, 
nM =0.3, and Ha = 0.7. 

The distance equations for the original FOF have to be mod- 
ified to take the cosmologically correct expressions into account. 
Eq. Q becomes 



Dij =2 sin -^Dave < Dl , 



(30) 



with the mean comoving angular distance Dave between the galax- 
ies i and j 



Dave 



dA {Ho, Qm, ^a, f ) + dA {Ho, nM, ^A, ^) 



(31) 



replacing the V of eq. J2}. 

In the case of the EXT-FOF, we use D (zini) of eq. for the 
calculation of the projected distance Dij . The individual spectro- 
scopic redshift errors of the galaxies are not yet taken into account, 
thus eqs. jsj and j9j still apply. A stepsize of Azint ~ 10^^ is 
used. 

The linking parameters are varied with redshift. Yet, due to 
the field-to-field variations in the sampling fraction, the photomet- 
ric limits, and the number of MOS fibres, this dependency is more 
complicated than in the case of the CFAl. Eqs. (17} and (TsJ are 
used for the variation of Dl and Vl, and R is given by 



R 



1 -1/3 



"fid 



(32) 



^exp J- j-y\^ jjjg expected galaxy number density in the field / at 
a comoving distance D. D is either Dave in the case of the original 
FOF, or D (zini) in the case of the EXT-FOF. n^i^ is the galaxy 
number density n*^^^ (/, D) in a fiducial field at a given fiducial 
distance, n'^^*' (/, D) is calculated with the help of the Schechter 
luminosity function $ (M): 



:f,D) = F 



$ (M) dM, 



(33) 



where F is the sampling fraction of field /. Mmin and AI,nax are 
the absolute photometric limits of this field, at the given comoving 
distance D. When comparing two galaxies from two different fields 
fa, and ft, the average of the expected galaxy densities is taken: 



•if,D) 



n-'' {fa,D) + n^'-^{f,,D) 



(34) 



Two Schechter luminosity functions are used /Lin et alj [1995 
[jucker et al. 1997), following the approach of Tucker et alJ il997l) . 
One is valid for the 50 fibre fields in the southern Galactic cap, 
and has a = -0.74, M* = -20.55 mag + 5 log h, and $* = 
0.016 /i^Mpc"^. The other is used for the 50 fibre fields in the 
northern Galactic cap, as well as for all 112 fibre fields. Its pa- 
rameters are a = —0.70, M* = —20.29 mag + 51og/i, and 
$* = 0.019 /i^Mpc^^ 

For the fiducial field the latter luminosity function is used. Fur- 
thermore, this field has a sampling fraction of F = 1, and apparent 
magnitude limits of 
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Figure 7. Variation of the linking criteria Dj^ and Vt for the LCRS group 
catalogues, as a function of redshift z. As a reminder, in this case R is not 
only a function of distance, but also depends on the field. We show a typical 
example field, having a MOS setup with 112 fibres, a sampling fraction 
of F = 0.7. and photometric limits of mmin = 15.0 mag, rrimax = 
17.7 mag. 



15.0 mag < mu < 17.7 mag. 



(35) 

The fiducial redshift is set to c 2 fid = 30000 kin s^ ^ . 

Only groups having at least Nmin = 3 members are searched 
for The projected separation Do and the velocity difference Vo are 
chosen as Dq — 0.7 15/i~^Mpc and Vp = 500kms~^, respec- 



tively, again followine lTucker et alJil997l) . 

The resulting variation of the linking criteria Dl and Vl as a 
function of redshift z is shown in Fig.|7l This graphic is created for 
an example field having a MOS setup with 112 fibres and photo- 
metric limits rrimin ~ 15.0 mag, rrimax ~ 17.7 mag. We use a 
typical sampling fraction of F — 0.7. 

4.3 Comparison of the group catalogues 

Application of the original FOF yields a total of 1367 groups. 6747 
galaxies are contained in these gr oups. The stru ctures are slightly 
deviant from the ones published bv lTucker etaP boOO.) . These dis- 
crepancies in the group composition arise from three differences in 
the treatment of the data: First, we include no colour corrections in 
eq. <28> , unlike Tucker et al. 1 2000). Second, for the determination 
of Dave and D (zirn) we make u se of the com oving angular dis- 
tance dA {Ho, ^M, ^A, z), while Irilcker et al] f200Qi') utilised the 
proper motion distance du {Ho, f^M, Ha, z). Third, we determine 
group memberships in a flat universe with a low matter content, 
whereas Tucker et al. 1 2000) used an Einstein-de Sitter universe. 

The catalogue resulting from the EXT-FOF application con- 
tains 1285 groups, with a total of 6337 galaxies. 

Table |2| shows the statistics of the comparison between the 
FOF and EXT-FOF structures in the case of the LCRS. The same 
object-to-object comparison is used as described in Section <3.3> . 

1280 of the FOF groups are recovered with the EXT-FOF al- 
gorithm, corresponding to a recovery rate of almost 94%. The EXT- 
FOF, on the other hand, finds only one group, that is not a member 
of the FOF group catalogue (category 3), leading to a spurious de- 
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Table 2. Comparison of the LCRS FOF and EXT-FOF structures 





category 


FOF 


EXT-FOF 


%FOF 


% EXT-FOF 


total 




1367 


1285 


100 


100 


identical 


1 


1145 


1145 


83.8 


89.1 


only FOF struct. 


2 


87 




6.4 




only EXT-FOF struct. 


3 




1 




0.1 


FOF struct, larger than EXT-FOF 


4 


120 


120 


8.8 


9.3 


EXT-FOF struct, larger than FOF 


5 


4 


4 


0.3 


0.3 


FOF struct, is combination of EXT-FOF structs. 


6 


5 


10 


0.4 


0.8 


EXT-FOF struct, is combination of FOF structs. 


7 


2 


1 


0.2 


0.1 


FOF and EXT-FOF structs. have some elements in common 


8 


4 


4 


0.3 


0.3 


FOF structs. found with EXT-FOF alg. 




1280 




93.6 





tection rate of less than 1%. Both the recovery and spurious detec- 
tion rates are based on the assumption that the original FOF algo- 
rithm used here yields a complete structure catalogue. 1 145 struc- 
tures are found identical by both algorithms, so the rate of identical 
recoveries lies at roughly 86%. Of the 87 FOF groups, that have no 
EXT-FOF counterparts (category 2), 77 have only three members, 
and the remaining ten contain four objects each. Those groups are 
all very elongated in redshift, making their group status question- 
able, anyway. The one group falling under category 3 is also ex- 
tremely small, having only three members. The above comparison 
shows, that both algorithms yield very similar results. 



4.4 Analysis of the discrepancies 

Like in the case of the CFAl, all of the deviations mentioned above 
can be explained by the two effects, that are described in Section 
1141 

The reason for finding an additional link with the EXT-FOF 
in the LCRS catalogue is very similar to the case of the CFAl. 
The equations used for the determination of the projected dis- 
tance Dij, eqs. <30> . <31> . and Js}, 0, show that Dij grows with 
dA (Ho, flM, i^A, z), while the linking distance Dl is varied with 
the luminosity function. Dij has a convex curvature, is con- 
cave. The slope of Dij is once more set by the angular separation 
9ij, and there are again three possible scenarios for the behaviour 
of Dij and Dl- Very small angular separations lead to Dij < Dl, 
and very large values of Oij lead to Dij > Dl for the entire redshift 
range, that is used for cluster finding. Fig.|8|shows the case of an ex- 
ample galaxy pair with a medium sized angular separation, leading 
to one intersection between Dij (dashed line) and Dl (solid line). 
The intersection divides the graph into two areas (dotted line). In 
area I the projected linking criterion Dij < Dl is satisfied, in area 
n it is not. Thus, whenever the original FOF is limited to compar- 
ing Dij with Dl at a value of Dave, that corresponds to a redshift 
in area IL and the redshift-space linking criteria, eqs. js) and j9j, 
allow the EXT-FOF to compare Dij with Dl in area L only the 
EXT-FOF can find a link between the galaxies i and j. 

The explanation, why the EXT-FOF cannot link together cer- 
tain objects in the LCRS catalogue, while the original FOF can, is 
completely analogous to the case of the CFAl: The original FOF 
can move relatively freely through redshift space in order to link 
galaxies to one another. The EXT-FOF, on the other hand, is always 
limited to fulfilling both linking criteria for at least Nmin objects 
within one redshift slice. 
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Figure 8. Comparison of the LCRS projected linking distance D l (solid 
line) and the projected separation Dij (dashed line) of the galaxies i and 
j, as a function of the redshift z. The angular separation 6ij of the chosen 
galaxy pair leads to one intersection between Dl and Dij, dividing the 
graph into two areas (dotted line). The projected linking criterion Dij < 
Dl is satisfied in area 1, yet not in area II. This example galaxy pair is lying 
in a field, with a MOS setup of 1 12 fibres, a sampling fraction of F = 0.7, 
and photomefi'ic limits oirrimin = 15.0 mag, rrimax = 17.7 mag. 



5 LCRS WITH SIMULATED PHOTOMETRIC 
REDSHIFTS 

In order to prove the validity of the EXT-FOF algorithm for struc- 
ture finding in photometric redshift datasets, we create artificial 
photometric redshifts for all galaxies contained in our LCRS galaxy 
catalogue. We determine a group catalogue, based on this new 
dataset, with the help of the EXT-FOF algorithm, taking into ac- 
count the photometric redshifts and their errors. A comparison 
between this structure catalogue and the EXT-FOF catalogue de- 
scribed in Section l4!2l shows the applicability of the EXT-FOF tech- 
nique in case of galaxy catalogues with photometric redshifts. 
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5.1 Creation of the galaxy dataset with simulated 
photometric redshifts 

For every LCRS galaxy a random redshift offset is created. The 
offsets follow a Gaussian distribution with a cr of roughly 5% of 
the survey depth. That percentage resembles the typical proportions 
between the redshift errors and the depth of a photometric redshift 
survey. In the case of the LCRS, the width of the Gaussian is set 
to ca = 2500 km s~^. The photometric redshifts are then created 
by adding the individual offsets to the spectroscopic redshifts of 
the galaxies. Furthermore, the redshift errors Szi are all set to a. 
Except for the redshifts of the galaxies and their errors, the galaxy 
dataset remains unchanged. 

5.2 Creation of the group catalogue 

To construct the EXT-FOF structure catalogue of this pseudo- 
photometric redshift dataset, we follow the recipe described in Sec- 
tion]^ 

The new galaxy dataset is given the same treatment as the 
spectroscopic one, as far as the velocity correction, and the culling 
of galaxies on the basis of the velocity and absolute magnitude lim- 
its is concerned. Due to this selection process, the subset of galaxies 
that enters the structure finding procedure can be deviant from the 
one used in the spectroscopic redshift case. A total of 22739 galax- 
ies are used for cluster finding in the spectroscopic redshift case, 
22617 galaxies are used in the photometric redshift case. 20255 
galaxies are common to both culled datasets. 

For the calculation of the projected distance Dij, we use 
D (zini) of eq. 0. The relatively large redshift errors are taken 
into account, and relations <10t and <1H apply for the linking ve- 
locities VL,i and Vlj- The stepsize Azi„i is again set to 10^''. 

The same low matter density cosmology is used as described 
in Section 1431 We furthermore use the same linking parameters 
and group mean velocity limits. The minimum number of objects 
is once more set to Nmin = 3. 

5.3 Comparison of the group catalogues 

Section l431 shows that in case of the spectroscopic dataset the EXT- 
FOF algorithm yields a group catalogue, that is very similar to the 
EOF group catalogue. Thus, in principle, the groups resulting from 
the photometric redshift dataset could be compared to either the 
EOF, or the EXT-FOF group catalogue of the spectroscopic dataset. 
For the following examination we choose the latter. 

The EXT-FOF algorithm finds a total of 1598 structures in 
the simulated photometric redshift galaxy catalogue. The groups 
and clusters contain 10831 galaxies. This corresponds to a mean 
of almost seven objects per structure. In the spectroscopic redshift 
case, structures contain a mean of almost five objects. Obviously, 
the EXT-FOF algorithm tends to find more and larger structures in 
photometric redshift datasets. 

The comparison between the EXT-FOF structures resulting 
from the spectroscopic redshift dataset and the structures resulting 
from the photometric one is shown in table |3| Again, we use the 
"category 1 - 8" classification scheme, that we already described 
in Sect. 13.31 replacing "EOF" with "EXT-FOF with spectroscopic 
redshift dataset" and "EXT-FOF" with "EXT-FOF with photomet- 
ric redshift dataset". 

Only 94 of the 1285 EXT-FOF structures with spectroscopic 
redshifts cannot be retrieved (category 2), leading to a recovery rate 
of almost 93%. 646 additional groups are found in the photometric 



galaxy dataset (category 3), corresponding to a spurious detection 
rate of 40%. The recovery and spurious detection rates are based on 
the assumption that the application of the EXT-FOF algorithm to 
the spectroscopic dataset results in a complete structure catalogue. 
The similarity between the original EOF groups and the EXT-FOF 
groups in the case of the spectroscopic galaxy dataset was already 
confirmed in Sect. 14.31 The number of structures, that are identical 
in both group catalogues is roughly 16%. Due to the strongly differ- 
ing input galaxy catalogues, this small percentage does not come as 
a surprise. Both the category 2 and 3 structures tend to be relatively 
small. 96.8% of the category 2 and 81.6% of the category 3 groups 
have less than five members. 

Under the assumption that the structures found by the EOF 
and EXT-FOF algorithms in the spectroscopic galaxy dataset are 
real, this comparison shows that the EXT-FOF algorithm is capable 
of finding almost all of the structures contained in this photometric 
redshift dataset. Roughly 60% of the found structures can be ex- 
pected to be real. This demonstrates that the EXT-FOF technique is 
a very conservative cluster finder and can be used for identification 
of cluster candidates in photometric redshift galaxy datasets. 



5.4 Analysis of the discrepancies 

Some of the discrepancies in the two group catalogues are due 
to the slightly different composition of the culled input galaxy 
datasets. After all, the composition of those datasets differs by 
roughly 10%. However, since this is only a border effect, we do 
not wish to give it further attention here. 

The really interesting discrepancies can be ascribed to either 
one, or both, of the following two reasons: 

One is the scattering of the newly created photometric redshifts 
against the real, spectroscopic ones. 

The other is the fact that the redshift errors are finally taken into 
account when testing the velocity linking criteria, using relations 
fTol and (ID- 
Due to the Gaussian nature of the simulated photometric red- 
shift distribution, almost 32% of the galaxies have new redshifts, 
that deviate by more than la from their original spectroscopic ones. 
However, eqs. jlOt and <11> only allow for a deviation from Zirn of 
roughly la (Vl /2 is relatively small compared to cSzi — cSzj = 
ca). So there is a non-negligible amount of galaxies in the photo- 
metric redshift dataset, that are found as linked in the spectroscopic 
redshift cluster catalogue, yet cannot be linked together anymore, 
because of their large scatter in photometric redshift. On the other 
hand, this scatter can also link galaxies together, that are defined as 
separated in the spectroscopic redshift dataset. So the scattering is 
responsible for both, finding and losing links between galaxies, and 
can result in any type of deviation ranging from category 2 to 8. 

Taking the redshift errors into account leads to much larger 
velocity linking criteria Vla and Vlj, than in the spectroscopic 
redshift case where we use Vl /2 as linking criterion. Thus, the new 
velocity linking criteria are by far less strict, and the probability of 
finding two objects as linked is increased. The new velocity link- 
ing criteria are responsible for finding additional links and result in 
deviations of category 3, 5, and 7. In combination with the above 
mentioned scattering of photometric redshifts they can also result 
in category 8 discrepancies. The consideration of the photometric 
redshift errors is the main reason for the obviously larger and more 
numerous structures found in the photometric redshift structure cat- 
alogue. 
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Table 3. Comparison of the LCRS EXT-FOF structure catalogues with and without simulated photometric redshifts 





category 


EXT-FOF 


EXT-FOF 


% EXT-FOF 


% EXT-FOF 






without Sz 


with 5z 


without dz 


with Sz 


total 




1285 


1598 


100 


100 


identical 


1 


230 


230 


17.9 


14.4 


only EXT-FOF without 5z struct. 


2 


94 




7.3 




only EXT-FOF with &z struct. 


3 




646 




40.4 


EXT-FOF without 6z struct, larger than EXT-FOF with &z 


4 


34 


34 


2.7 


2.1 


EXT-FOF with 5z struct, larger than EXT-FOF without 6z 


5 


382 


382 


29.7 


23.9 


EXT-FOF without 5z struct, is combination of EXT-FOF with 5z structs. 


6 








0.0 


0.0 


EXT-FOF with 5z struct, is combination of EXT-FOF without 5z structs. 


7 


229 


92 


17.8 


5.8 


EXT-FOF with and without 5z structs. have some elements in common 


8 


316 


214 


24.6 


13.4 


EXT-FOF without 5z structs. found with EXT-FOF with 5z alg. 




1191 




92.7 





6 SUMMARY 

In this paper we presented a new structure finding algorithm de- 
signed to detect groups and clusters of galaxies in a galaxy cata- 
logue having photometric redshifts. 

Since the development of the photometric redshift technique 
this type of galaxy survey has grown in popularity. The relatively 
large errors in the redshift evaluation pose a problem for a reli- 
able cluster identification, though. One of the most common clus- 
ter finding methods is the friends-of-friends (or FOF) technique. It 
utilises a straightforward approach by looking for overdensities in 
the galaxy distribution and not depending on any model assump- 
tions for clusters, and it proved to yield reliable results. However 
this algorithm is only capable of looking for structures in either 
spectroscopic redshift datasets, or simulated galaxy datasets. We 
thus created a modified algorithm, the extended friends-of-friends 
(or EXT-FOF), based on the original FOF, that is a tailor-made so- 
lution for dealing with the large photometric redshift errors. 

We presented our new algorithm and explained the major dif- 
ferences between the original FOF and the EXT-FOF method. The 
basic working principle of the EXT-FOF being the limitation to 
galaxies for structure finding, that are compatible within their red- 
shift errors with given redshift values. 

The EXT-FOF algorithm was tested on two spectroscopic red- 
shift surveys, the Center for Astrophysics Redshift Survey (CFAl) 
and the Las Campanas Redshift Survey (LCRS). The resulting 
groups were compared to structures found with the original FOF 
in those two surveys. We were able to show that in case of spectro- 
scopic redshift datasets, the FOF and EXT-FOF group catalogues 
were almost identical to each other. The comparison provided a re- 
covery rate of almost 98% and 94% of the FOF structures for the 
CFAl and LCRS cluster catalogues, respectively. A spurious de- 
tection rate of less than 5% and 1% of the EXT-FOF structures was 
found for the CFAl and LCRS cluster catalogues. Both the recov- 
ery and spurious detection rates were based on the assumption that 
the original FOF algorithm yielded a complete structure catalogue. 
All of the discrepancies between the FOF and EXT-FOF group 
compositions were explained. We showed that our new algorithm 
is reluctant to identify an accumulation of galaxies as a cluster if it 
is very elongated along the redshift axis. This is an attribute, that 
makes the EXT-FOF technique very valuable for cluster-finding in 
photometric redshift datasets. 

We furthermore tested our EXT-FOF algorithm on a dataset 
with simulated photometric redshifts that we created by assigning 



Gaussian distributed redshift offsets to the galaxies of the LCRS. 
A comparison of the resulting group catalogue with the one found 
by the EXT-FOF in case of the spectroscopic LCRS proved, that 
the EXT-FOF method yields a conservative cluster catalogue, and 
is very capable of structure finding in photometric redshift galaxy 
datasets. A recovery rate of almost 93% of the spectroscopic red- 
shift structures and a spurious detection rate of about 40% of the 
photometric redshift structures were found. Furthermore, as was 
expected, a general tendency for finding larger groups was discov- 
ered in the photometric redshift case. 

This paper is the first in a series of papers, dealing with the 
search for groups and clusters of galaxies in photometric redshift 
datasets. The next paper will show the application of the new EXT- 
FOF algorithm to the Munich Near-IR Cluster Survey (MUNICS; 
Drory et al. 2001b). 
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